Lymphocyte subset phenotyping for the prediction of progression to inflammatory arthritis in anti-citrullinated-peptide antibody-positive at-risk individuals

Abstract Objectives Inflammatory arthritis (IA) is considered the last stage of a disease continuum, where features of systemic autoimmunity can appear years before clinical synovitis. Time to progression to IA varies considerably between at-risk individuals, therefore the identification of biomarkers predictive of progression is of major importance. We previously reported on the value of three CD4+T cell subsets as biomarkers of progression. Here, we aim to establish the value of 18 lymphocyte subsets (LS) for predicting progression to IA. Methods Participants were recruited based on a new musculoskeletal complaint and being positive for anti-citrullinated-peptide antibody. Progression (over 10 years) was defined as the development of clinical synovitis. LS analysis was performed for lymphocyte lineages, naive/memory subsets, inflammation-related cells (IRC) and regulatory cells (Treg/B-reg). Modelling used logistic/Cox regressions. Results Of 210 patients included, 93 (44%) progressed to IA, 41/93 (44%) within 12 months (rapid progressors). A total of 5/18 LS were associated with progression [Treg/CD4-naïve/IRC (adjusted P < 0.0001), CD8 (P = 0.021), B-reg (P = 0.015)] and three trends (NK-cells/memory-B-cells/plasmablasts). Unsupervised hierarchical clustering using these eight subsets segregated three clusters of patients, one cluster being enriched [63/109(58%)] and one poor [10/45(22%)] in progressors. Combining all clinical and LS variables, forward logistic regression predicted progression with accuracy = 85.7% and AUC = 0.911, selecting smoking/rheumatoid-factor/HLA-shared-epitope/tender-joint-count-78 and Treg/CD4-naive/CD8/NK-cells/B-reg/plasmablasts. To predict rapid progression, a Cox regression was performed resulting in a model combining smoking/rheumatoid factor and IRC/CD4-naive/Treg/NK-cells/CD8+T cells (AUC = 0.794). Conclusion Overall, progression was predicted by specific LS, suggesting potential triggers for events leading to the development of IA, while rapid progression was associated with a different set of subsets.


Introduction
RA is a chronic autoimmune, inflammatory joint disease.A pre-clinical phase of RA has been identified, also known as the at-risk phase of the inflammatory arthritis continuum (IAC) [1,2].The at-risk phase can last up to 15 years, during which genetic and environmental factors contribute to the progression including a break in tolerance and the development of systemic autoimmunity manifested by the presence of autoantibodies (particularly, anti-citrullinated peptide antibodies-ACPA).The development of pain and other musculoskeletal symptoms in the absence of synovitis precedes a final stage when synovitis develops.Treatment is conventionally initiated upon the detection of clinical synovitis.
Given the success of treatment for early RA [3], predicting an individual's progression to IA/RA may enable preventive interventions.However, the rate of progression to IA is $30-40% depending on the criteria used to identify at-risk individuals [4,5].ACPA and/or rheumatoid factor (RF) are widely

Rheumatology key messages
• Novel predictive value is associated with B-reg, NK-cells-CD56 bright and CD8þT-cells in addition to Treg/naive-CD4þT-cells.
• Different lymphocyte subset profiles are associated with clusters of patients with different IA outcomes.
• Overall and short-term progression are predicted by different cell subsets and clinical variables.
The involvement of immune cells in RA pathogenesis has been extensively described (T/B/NK-cells, monocytes [8][9][10][11][12][13]).We established the predictive value of the frequencies of circulating naive CD4þT cells, regulatory T cells (Tregs), and a subset of naive cells (CD62L-) able to enter IL6-expressing tissues (called inflammation-related cells IRC) for RA diagnosis, for the induction of remission at first treatment, flares in patients in clinical remission and ability to safely discontinue anti-TNF [14][15][16].In ACPAþ at-risk individuals, these three CD4þT-cell subsets showed good predictive value as biomarkers for progression to IA, both individually and when combined with clinical variables [15,17].
The specific cellular and molecular events that influence progression to the next stage of the IAC [18] remain unclear.Here, we test the hypothesis that the dysregulation of other lymphocyte subsets (LS), in addition to the three CD4þT-cell subsets previously reported [15,17], may provide mechanistic clues as to the cellular events underpinning the progression to IA.Furthermore, we proposed to investigate whether an extended LS analysis (performed by hospital services on fresh blood over a few hours), can provide an improvement of the performances of current prediction models [15,17] that are using only three CD4þT-cell subsets in combination with demographic and clinical data.

Study cohort
Patients have been recruited in the Coordinated Programme to Prevent Arthritis register since 2008.Ethical approval was obtained (REC approval: 06/Q1205/169) and all participants provided informed written consent.
Recruitment and follow-up have been described in a previous publication [6].To sum up, recruitment criteria were a new non-specific musculoskeletal complaint and ACPA (detected by a routine hospital test) or RF positivity or as a relative to patients with RA, but in the absence of clinical synovitis.Participants were followed until IA occurred.Progression is defined by the development of inflammatory arthritis (clinical synovitis) i.e. !1 swollen joint and evaluated by a senior rheumatologist.Patients with non-progression were included only if they had >12 months of follow-up (details in supplementary material, available at Rheumatology online).Data collected included demographic (age, gender), lifestyle (smoking, alcohol), genetic (shared epitope, HLA-SE), physical assessment (78 joint tenderness, early morning stiffness), serology (RF) and inflammatory markers (erythrocyte sedimentation rate, C-reactive protein).
Patients were selected from the register (which recruited over 500 participants) if they had flow cytometry performed (see below) and if they were ACPA-positivity by a second generation CCP-2 test (Immuno-CAP, Phadia, Sweden, positivity cut-off at 10 OD), performed in our research lab as false positivity with the hospital test was recently described [19], and irrespective of positivity for RF.
All patients included in this study (n ¼ 210), had data for the CD4þT cells panel (naive, IRC) the lymphocyte count panel (LS), and the Treg panels' missingness is described.Some were excluded due to technical issues (poor quality of blood due to transport delays), while for the Treg panel, a shortage of one antibody (FoxP3) led to the panel not being done.For the CD8 panel, delays in transport were associated with difficulties in gating the CD8-IRC subset.The B-cell panel was introduced late (2015), hence limiting us to 210 patients among all participants in the register.Altogether, 150/210 patients had a full dataset and we imputed data for individual subsets in 41 patients with >15/18 LS present.We inputted data for the B-panel in 19 patients who had complete data for all other LS.
LS were quantified by flow cytometry using five panels for lineage: CD4þ/CD8þ T-cell subsets (naive, memory, IRCs); Treg; B-cell subsets (naive, memory, plasmablasts, B-reg).Detailed methodology is described in supplementary material (available at Rheumatology online) and previously detailed [15].All blood samples were processed fresh within a few hours of the collection after transport from the clinics to the central NHS immunology laboratory in Leeds.The gating strategies are illustrated in Supplementary Fig. S1, available at Rheumatology online.LS frequencies were reported as the percentage of the parental population.Age relationships for specific cell subsets were previously described [20] and normalisation was applied [15].

Statistical analysis
A Spearman correlation-based clustering algorithm (Cluster-3, Stanford University 1998-99) was applied after log transformation to assess collinearity between LS frequencies.A heat-map was generated using TreeView.
Exploration of data was performed for univariate analysis using Mann-Whitney U and v 2 tests, between progressors and non-progressors groups.Correction for multiple testing was applied in univariate analysis using Bonferroni correction [21,22].Variables were assessed individually for predictive value using the unadjusted odd ratio (OR, 95%Cl) and area under the ROC curve (AUC, 95%Cl).Logistic and Cox regression modelling was used to predict progression using a stepwise forward method [23], selecting the best combination of variables for significant improvement of the fit.Missing data imputation was performed using the tool for multiple imputation process in SPSS setting up the boundaries of data to be inputted and then the 'Imputing Missing Data Values' function over five cycles.The estimates obtained from each dataset (five imputation datasets) were aggregated to produce an overall imputation estimate using the same SPSS package.The pooled dataset after imputation was compared with the non-imputed dataset to verify that ORs were not affected by the imputation process.Pooled analysis was performed, and ORs were not affected by the imputation process.Sensitivity, specificity and positive/negative predictive values (PPV/NPV) were calculated from the classification matrix of accurately predicted cases obtained from logistic regression.A total of 14% of cases had one or another missing data, while we confirmed that data were missing at random using Little's MCAR test (P < 0.0001).WALD tests from the regression analyses performed were used to assess the contribution of each of our predictor variables.
A bootstrapping technique was used (500 permutations) to calculate a discrimination index correcting for optimism between the predicted and actual outcome for the best regression models.
We used graphical diagnostics based on the scaled Schoenfeld residuals to examine the proportional assumption Prediction of progression to inflammatory arthritis in at-risk individuals for the Cox regression.This allowed us to assess whether the hazard functions remained proportional over time.

Cohort outcome
ACPAþ patients (n ¼ 210) with a minimum of 12 months (up to 10 years) of follow-up were selected from the overall cohort.Clinical data were retrieved and progression to clinical synovitis was observed in 93/210 (44%) of participants, occurring under 12 months in 41 patients (rapid progression), within 13-24 months for 18 patients, with the last 33 progressing later than 2 years post inclusion and one patient after 10 years (Supplementary Fig. S2, available at Rheumatology online).A total of 75% of progressors met the EULAR 2010 Classification criteria for RA at the time of progression with an average of >3.5 swollen joints (range 1-15) and were directed to our early arthritis clinic for further care.
Demographic/clinical data are described in Table 1.Association between progression and data at inclusion suggested two highly significant parameters (HLA-SE/RF, MWU P < 0.0001 after correction) and another three potential associations (smoking/EMS/TJC78, MWU P < 0.05), consistent with published reports in this cohort [6].AUCs were calculated suggesting high values for RF/HLA-SE (AUC > 0.650, P < 0.0001) and for smoking/TJC78/EMS (AUC > 0.600, P < 0.010).The individual contribution to the prediction was however relatively small for all parameters with 19% for RF (Wald test) and >8% for the other three variables.Of note, no difference in ACPA levels were detected between progressors (mean 335 OD) or non-progressors (341 OD) as well as between rapid (355 OD) and delayed (344 OD) progressors.

Flow cytometry analysis and univariate association with progression
The overall results of LS phenotyping (Fig. 1, Table 2) showed significant association with progression for five subsets: lower naive CD4þT cells, Treg (P < 0.0001) and CD8þT cells (P ¼ 0.021), and higher CD4-IRC (P < 0.0001) and B-reg (P ¼ 0.015).Three more subsets, higher NK-cells CD56 bright , higher memory B cells and lower plasmablasts showed non-significant difference (after correction) that may nonetheless suggest other biological events leading to progression.
There was only one significant correlation between two subsets (naive B cells/memory B cells, rho ¼ À0.885, P < 0.0001).

Clusters of patients
To define groups of patients with similar LS profiles, we used hierarchical clustering, while not specifying the outcome (unsupervised).A pilot analysis using heat-map of frequencies (Supplementary Fig. S3, available at Rheumatology online), showed three distinct distributions of the LS.Each group contained one of the highly predictive subsets, suggesting that these were dysregulated independently of each other.The analysis also segregated patients into three clusters with different proportions of progressors (P < 0.0001) suggesting that different LS profiles can discriminate patients with different outcomes.
We repeated this approach limiting data to the eight subsets identified above.This analysis distributed LS in four groups and patients in three clusters (Fig. 2).The first subset LS group (purple) included only plasmablasts, the second (orange) included naive CD4þT, B-reg and NK-CD56 bright cells; the third (pink) combined memory B cells, Treg and CD8þT cells; the last group (blue) with IRC-CD4 alone.Cluster-I was defined by high Treg and high naive CD4 but very low IRC-CD4 and was mainly composed of non-progressors 47/60 (78%).Cluster-II was mainly driven by high plasmablasts while Tregs were also high and CD4þIRC low.This profile was associated with a mixed outcome of 17/41 (42%) progressors and 24/41 (58%) non-progressors.In the largest cluster-III, IRC-CD4 were particularly high and all other LS showed mixed patterns defining subgroups of patients.The proportion of progressors in cluster III was 58% (63/109), which was significantly different from the other two clusters (P < 0.0001).b Tests adjusted for 10 comparisons (adjustment of the P-value was performed by applying Bonferroni correction method for multiple comparison test).AUROC: area under the roc curve; EMS: early morning stiffness; HLA (SE): human leucocyte antigen (shared epitope); TJC: tender joint count.
There was no difference in any demographic or clinical data between these three clusters of patients.We, however, observed higher levels of ACPA (P ¼ 0.012) in cluster III (CCP2-test: mean 220 OD) compared with cluster I (158 OD,) but not with cluster II (243 OD, P ¼ 0.177).No association was seen between plasmablasts and ACPA (or RF) levels.

Multivariate modelling for the prediction of progression to IA
We previously reported predictive value for three CD4þT cell subsets (naive/IRC/Treg) [15].Modelling using an enter approach in these 210 patients (as in previous work,  Model-1 selected four parameters in a stepwise construction, starting with RF, then sequentially adding HLA-SE, TJC78 and smoking.This model accurately predicted 70% of cases, with SEN/SPE ¼ 56%/81.2%and a good NPV/ PPV ¼ 70%, with an AUC ¼ 0.744 (Fig. 3A).However, only 25% of the variance for predicting progression was accounted for (Nagelkerke R-square) and individual variable contribution was 17% for RF and <7% for the other three (individual Wald score).
We further applied internal validation to model-3 to correct for optimism using a bootstrapping technique (500 permutations), resulting in an optimised Dxy-value ¼ 0.458.We ran a calibration and observed no overfitting of the bootstrapping technique (slopes was close to 1; Supplementary Fig. S5, available at Rheumatology online).
The high PPV/NPV of model-3 (77%/86%) suggests that it is possible to predict individuals who are likely to progress while identifying those who have low risk and may be monitored less often (even discharged).In this cohort, using dichotomisation based on the probability to progress calculated for each patient in model-3, 65/210 could be deemed high-risk (Fig. 3B, i.e. probability >80%) and all but six progressed (91% accurate).Alternatively, 78/210 cases were low-risk (i.e.probability <20%) and only six (7.7%) progressed over 10 years.

Modelling rapid progression to IA
Time to progression is widely distributed in this cohort ranging from 1-120 months.Different LS associations may therefore be involved at different stages of the progression, and some may be more predictive of the onset of IA symptoms than others.A total of 41/93 (44%) of the progressors did so rapidly and we re-analysed these progressors using a Cox regression.
The assumption or proportional hazard was verified for all variables in model 6 (Supplementary Fig. S4, available at Rheumatology online).Cox regression models were constructed using the same forward approach (Table 3).Un-adjusted hazard ratio (HR) for time to progression was significant for four clinical variables (smokers/HLA-SE/RF/EMS, CRP showing a trend) and for six LS (Treg/naive CD4þT cells/CD4-IRC/CD8þT cells/NK-CD56 dim /B-reg).
Being able to identify individuals at high risk of rapid progression would allow for the design of a clinical trial aiming at the prevention of progression within a short trial duration of only 12 months.In this group, 22 participants were dichotomised based on a high-hazard (individual X-beta score value > 2) for rapid progression in model-6, and 20 (91%) progressed (Fig. 3D).

Applicability for daily practice
Flow cytometry is routinely used in Leeds and worldwide.We therefore evaluated the gain in terms of accurate stratification of adding LS panels (cost/time/technology) over only using demographic/clinical data.Individual participants' probability of progression was dichotomised (based on a cut-off at 80% specificity) into high/low risk groups, for the demographic/ clinical data then sequentially adding 1-4 flow-cytometry panels (Supplementary Table S3, available at Rheumatology online).
The stratification showed 70% accuracy for the reference model (Fig. 3E).Adding the Treg panel increased accuracy to 75.2%.Adding the naive CD4þT cells panel showed further accuracy ¼ 80.4% while adding CD8þ/NK-cells (lineage panel) and then B-reg (B-cell panel) only achieved a marginal improvement of accuracy (to 81%/81.5%,respectively).The AUC, however, was still improving with every incremental step and all four panels added value.
A similar analysis for imminent progression suggested that only three panels would be needed while the performances clearly improved with the three steps (Supplementary Table S4, available at Rheumatology online).

Discussion
Our study demonstrated the value of extensive phenotyping of LS to predict progression to IA in ACPAþ at-risk individuals.The clustering analysis suggested that progressors are associated with specific LS profiles, cluster-III dominated by high CD4-IRC and cluster-II by high plasmablasts/Treg.Nonprogressors (cluster-I) were characterised by high Treg/CD4naive and low plasmablasts/CD4-IRC.These profiles were not associated with any particular demographic/clinical data, suggesting that they are independently regulated from any genetic/environmental or inflammatory events.We, therefore, confirmed previous data for CD4þT-cell subsets and further identified LS associated with progression, providing clues to the identity of cells (CD8/NK/B-cells) involved in the events triggering or associated with progression.Importantly, different subsets were retained in the modelling for overall compared with imminent progression, suggesting a time frame for different biological roles/triggers.Overall, our data confirm that wider LS dysregulation precedes the development of Innocent Anioke et al. a Normalised frequency.AUC: area under the roc curve; EMS: early morning stiffness; Exp-: expanded; HLA (SE): human leucocyte antigen (shared epitope); HR: hazard ratio; IRC: inflammatory-related cells; NK: natural killer; NKT: natural killer-T; NPV: negative predictive value; OR: odds ratio; PPV: positive predictive value; TJC: tender joint count; Treg: regulatory T cells.clinical synovitis, while also providing increased accuracy over previous models with only CD4þT-cell subsets.

Prediction of progression to inflammatory arthritis in at-risk individuals
Our findings showed an increase in circulating B-reg frequencies paralleling a reduction in Treg, suggesting an additional role for regulatory mechanisms before the onset of clinical synovitis.Functionally defective B-regs have been associated with homing to synovitis [24] while other studies have associated inflammation with the expansion of B-regs [25,26].In new-onset RA, the failure of B-reg to maintain a functionally suppressive Treg population was demonstrated [24,27].It is therefore conceivable that a decrease in Treg (loss of tolerance), paralleled with an increase in B-regs (subclinical inflammation) are both associated with progression to IA.On the other hand, our findings do not exclude that Bregs may be functionally defective [28,29] or were excluded from the synovium (i.e.circulating), limiting their ability to perform their role locally.Future work will be needed to determine which hypothesis may be correct.
Consistent with our original work [15,17], a reduction in circulating naive and Treg CD4þT cells frequencies and an increase in IRC-CD4 predates the development of IA (AUC ¼ 0.790, n ¼ 102).Since this original work, a standardised normalisation procedure for the naive/memory and Treg subsets was established [15,17], allowing for analysis of different phase-specific outcomes across the IAC using continuous data.Applying this to this group of 210 patients provided further validation of the original three CD4þT-cell subset model using the same logistic regression approach based on entering all variables in the model (enter method); however, it suggested that IRC-CD4 were no longer independently contributing to the prediction in this larger group (OR ¼ 1.104, P ¼ 0.144).Modelling using a forward method allowing for the best predictors only (data not shown, accuracy ¼ 78.6%, AUC ¼ 0.880) confirmed that IRC were less predictive of the overall progression over 10 years, while still highly associated with rapid progression over 12 months, as also observed in the clustering (Fig. 2) where most of the rapid progressors [28/41 (68%)] were in Cluster III defined by higher CD4þIRC.
We have associated many defects in the naive CD4þT-cell subset with early RA pathogenesis, notably in relation to a decline in thymic activity, aberrant signalling and aberrant proliferation reducing their TREC (T-cell receptor excision circle) content by >50% (1-2 cell cycle) [13], impaired IL7 responsiveness [30] and recently an IL6-driven network of epigenetic modifications suggesting the development of a subpopulation expressing more pro-inflammatory cytokines and closely resembling IRCs [31,32].In addition, naive CD4þT-cell loss was also shown to directly result from the differentiation of naive cells into IRC [13], a process driven by inflammation directly related to measures of inflammation [13] with a central role for IL6 in driving such changes and loss of IL6R expression as a result of its signaling [31].Furthermore, IRCs are persisting when inflammation is subclinical due to reduced expression of pro-apoptotic genes [32] and are associated with the occurrence of flares in patients in synthetic-DMARDsinduced remission [33].Most importantly, IRCs remained naive to an antigen challenge (hence expressing CD28) [13,32], and were shown to be recent progeny of naive cells with a high content of T-cell receptor excision circles (TREC) [13].As such, they are not to be confused with terminally differentiated T-effector memory T cells (TEMRA) re-expressing CD45RA, with controversies about high/low levels of expression of CD45RO and expression of CCR7 (reported negative or positive), while more consistently lacking CD62L, CD27 and CD28 expression, and also differences between CD4þ and CD8þ T cells [34][35][36][37] which are antigenexperienced (with low TREC content).Although, the 5-colour panel used here does not include all the markers that would definitely differentiate all the various subsets of memory T cells (notably TEMRA from IRCs), this does not alter the biomarker value of the IRC phenotype identified in this study.Indeed, here, we observed that IRC-CD4 enables segregation of a particular cluster of patients, the majority being rapid progressors, 28 of the 41 (68%) progressors being in Cluster-III.This further supports the hypothesis that imminent progression towards IA may be driven by an event involving or resulting from subclinical inflammation, driving or being associated with the differentiation of naive CD4þT cells into IRC as previously hypothesized [13,17,31].Other CD4þTcell subsets are relevant to RA pathology, notably Th17 cells and Tfh cells [38,39].However, it would have been very limiting to include panels for these in 2017-19 having started this work long before.Alternatively, the addition of a test for the enumeration of Th17 cells using a DNA-methylation specific qPCR [40] is, however, possible and should prove informative using stored whole blood.
Although CD8þT cells share some of the pathways genetically associated with CD4þT cells in RA [41], they have not been reported for their potential biomarker value to our knowledge, with one report presenting an association with arthralgia [42].In preclinical IA, reports, however, suggest that CD8þT cells make up $40% of the total T cells infiltrating the synovium [43,44].Following prolonged/chronic exposure to infectious agents, alterations in homing molecules expressed by CD8þT cells occur [45], resulting in enhanced/ altered migration.Here, we indeed demonstrated a reduced frequency of circulating CD8þT cells, predictive of progression that could be reflecting migration/accumulation into the joint.Alternatively, this reduction could reflect a contraction of the CD8þT-cell pool following an infection (post activation cell death), suggesting a role for subclinical infections as additional environmental risk for RA, as proposed in the past [46].Furthermore, cell death causes the release of inflammatory mediators that might also serve as triggers of cascades of events, particularly netosis [47].In our study, NK-cells CD56 dim were also predictive of progression suggesting that dysregulation of NK cells may have a significant biological role, further fuelling an inflammatory cascade leading to disease progression.Several studies have indeed supported a role for NK cells in RA pathology [48,49] and at the onset of disease in ACPAþ RA and in pre-RA (arthralgia), although this observation was not directly related to progression [50].
The clinical predictors in the models developed in this study were slightly at variance with those from a previously published model [6] probably because different patients and parameters were used (notably 78-TJC rather than hands small TJC and importantly, ACPA-positivity based on a CCP2 test).Model-1 notably only explains 25% of the variance in predicting IA.Modelling using combined data (model-3) showed that six LS and the same four clinical parameters had superior value over model-2 and clearly over model-1 (þ15.7% accuracy).Nonetheless, only 62% of the variance can be explained by model-3, leaving room for additional makers to be added, possibly using imaging and cytokines or epigenetic modification as recently evidenced in early

Figure 1 .
Figure 1.Frequency of lineage and lymphocytes subsets in at-risk progressors vs non-progressors.LS were analysed by flow cytometry and data displayed as violin plots (each dot representing a patient) for Progressor (Prog, n ¼ 93) and non-Progressors (Non-Prog, n ¼ 117).Star (*) indicates LS that were normalised as previously described [15].P-value corrected for multiple testing (MWU test) are indicated when significant and # designate trends

Figure 3 .
Figure 3. Performances of the models.(A) AUROC graphical representation of the logistic regression models.Binary logistic regression models of the occurrence of progression to inflammatory arthritis (IA) were constructed using Model 1 (Clinical data only) for 10 parameters (thin line), Model 2 (flowdata only) for 18 subsets (dotted line) and Model 3 (clinical þ flow data, thick line).Model 1 (AUC ¼ 0.744 95%CI 0.678-0.810)was inferior to Model 2 (AUC ¼ 0.862, 0.814-0.910)and Model 3 still showed added value (AUC ¼ 0.911, 0.871-0.951).(B) Survival curve based on classification using Model 3. Survival plot analysis was performed after patients were dichotomised for high-risk (black line, n ¼ 65/210) and low-risk (grey line, n ¼ 145/210) based on individual probability (>0.80%) calculated from the logistic regression.(C) Variables contribution to Model 6: This showed the relative importance order of the predictors in the model with Treg as the most discriminating biomarker for rapid progression followed by naiveCD4, smoking, CD8, RF, IRC-CD4 and finally NK-CD56 dim cells.(D) Survival curve based on the Cox regression for rapid progression.Survival plot analysis was performed after patients were dichotomised for high-risk (black line, n ¼ 22/158) and low-risk (grey line, n ¼ 136/158) based on individual hazard (>2) calculated from the Cox regression.(E) Overall performance of the prediction model using 1 to 4 flow cytometry panels.Individual participants' probability for progression was dichotomised into high/low-risk groups (based on 80% specificity) in five logistic regression models including the demographic/clinical data only first and then, sequentially adding data from 1, 2, 3 and then 4 flow-cytometry panels.Numbers of patients in both risk groups are displayed against the number of progressors (black bars) and non-progressors (open bar)

Table 1 .
Association of demographic and clinical data with progression (n ¼ 210) Categorical data are presented as n (% of participant).aNumerical data are presented as median (Interquartile range values); MWU Mann-Whitney U and v 2 tests for continues and categorical variables, respectively, were used.
Data are presented as median (interquartile range values).aMWU Mann-Whitney U test adjusted P-value for 18 comparison (adjustment of the P-value was performed by applying Bonferroni correction method for multiple comparison test).b Normalised subsets.AUROC

Table 3 .
Modelling for the predicting of overall and rapid progression